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CRYSTALLIZABLE COMPOSITIONS COMPRISING A HEPATITIS C VIRUS NS3 PROTEASE 
DOMAIN/NS4A COMPLEX AND CRYSTALS THEREBY OBTAINED 

5 TECHNICAL FIELD OF INVENTION 

The present invention! relates to compositions 
and crystals of a hepatitis C virus protease in complex 
with its viral cof actor. This invention also relates 

10 to methods of using the structure coordinates of 

% 

hepatitis C virus protease in complex with a synthetic 
NS4A to solve the structure of similar or homologous 

proteins or protein complexes. | 

f 

15 BACKGROUND OF THE IN VENTION 

Infection by hepatitis C virus (HCV) is a 
compelling human medical problem. HCV is recognized as 
the causative agent for most cases of non-A, non-B 

20 hepatitis, with an estimated human seroprevalence of 1% 
globally (Choo, Q.-L. et al., "Isolation of a cDNA 
Clone Derived From a Blood-Borne Non-A, Non-B Viral 
Hepatitis Genome", Science , 244, pp. 359-362 (1989); 
Kuo, G. et al., "An Assay for Circulating Antibodies to 

25 a Major Etiologic Virus of Human Non-A, Non-B 

Hepatitis", Science , 244, pp. 362-364 (1989); Purcell, 
R.H., "Hepatitis C virus: Historical perspective and 
current concepts", FEMS Microbiology Reviews , 14, pp. 
181-192 (1994); Van der Poel, C.L., "Hepatitis C Virus. 

30 Epidemiology, Transmission and Prevention in Hepatitis 
C virus. Current Studies in Hematology and Blood 
Transfusion, H.W. Reesink, Ed.,;(Basel: Karger) , pp. 
137-163 (1994)]. Four million individuals may be 
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infected in the United States alone [Alter, M.J. and 
Mast, E.E., "The Epidemiology of Viral Hepatitis in 
the United States, Gastroenterol. Clin. North Am. , 23, 
pp. 437-455 (1994) ] . 
5 ■ Upon first exposure to HCV only about 20% of 

infected individuals develop acute clinical hepatitis 
while others appear to resolve the infection 
spontaneously. In most instances, however, the virus 
establishes a chronic infection that persists lor 
10 decades| [Iwarson, S. "The Natural Course of Chlonic 

Hepatitis", FEMS Microbiology Reviews , 14, pp. 1201-204 

'Sl ' & 

(1994)] | This usually results in recurrent and 
progressively worsening liver inflammation, which often 
leads to more severe disease states such as cirrhosis 

15 and hepatocellular carcinoma {Kew, M.C., "Hepatitis C 
and Hepatocellular Carcinoma", FEMS Microbiology 
Reviews / 14, pp. 211-220 (1994); Saito, I., et . al . 
"Hepatitis C Virus Infection is Associated with the 
Development cf Hepatocellular Carcinoma", Proc. Natl. 

20 Acad. Sci. USA 87, pp. 6547-6549 (1990)]. Currently, 
there are no broadly effective treatments for the 
debilitating progression of chronic HCV. 

The HCV genome encodes a polyprotein of 3010- 
3033 amino acids (Figure 1) [Choo, Q.-L., et al. 

25 "Genetic Organization and Diversity of the Hepatitis C 
Virus", Proc. Natl. Acad. Sci. USA , 88, pp. 2451-2455 
(1991); Kato, N. et al., Molecular Cloning of the Human 
Hepatitis C Virus Genome From Japanese Patients with 
Non-A, Non-B Hepatitis", Proc. Natl. Acad. Sci., USA , 

30 87, pp. 9524-9528 (1990); Takamizawa, A. et al ., 

"Structure and Organization of the Hepatitis C Virus 
Genome Isolated From Human Carriers", J. Virol.^ , 65, 
pp. 1105^-1113 (1991)]. The HCV nonstructural (NS) 
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proteins provide catalytic machinery for viral 
replication. The NS proteins are derived by 
proteolytic cleavage of the polyprotein 
[Bartenschlager, R. et al . , "Nonstructural Protein 3 of 
the Hepatitis C Virus Encodes a Serine-Type Proteinase 
Required for Cleavage at the NS3/4 and NS4/5 
Junctions", J. Virol . , 67, pp. 3835-3844 (1993); 
Grakoui, A. et al. "Characterization of the Hepatitis C 
Virus-Encoded Serine Proteinase : Determination of 
Proteinase-Dependent Polyprotein Cleavage Sites", J. 
Virol. , 67, pp. 2832-2843 (1^93); Grakoui, A. et al., 
Expression and Identification! of Hepatitis C Virus 
Polyprotein Cleavage Products/', J. Virol. , 67, pp. 
1385-1395 {1993); Tomei, L. ejt al., "NS3 is a serine 
protease required for processing of hepatitis C virus 
polyprotein", J. Virol. , 67, pp. 4017-4026 (1993)]. 

The HCV NS protein 3 (NS3) contains a serine 
protease activity that helps process the majority of 
the viral enzymes, and is thus considered essential for 
viral replication and infectivity. It is known that 
mutations in the yellow fever virus NS3 protease 
decreases viral infectivity [Chambers, T.J. et. al., 
"'Evidence that the N-termnal Domain of Nonstructural 
Protein NS3 From Yellow Fever Virus is a Serine 
Protease Responsible for Site-Specific Cleavages in the 
Viral Polyprotein", Proc. Natl. Acad. Sci. USA , 87, pp. 
8898-8902 (1990)]. The first 181 amino acids of NS3 
(residues 1027-1207 of the viral polyprotein) have been 
shown to contain the serine protease domain of NS3 that 
processes all four downstream sites of the HCV 
polyprotein (Figure 1) [C. Lin et al., "Hepatitis C 
Virus NS3 Serine Proteinase: -Trans-Cleavage 
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Requirements and Processing Kinetics", J. Virol. , 68, 
pp. 8147-8157 (1994) ] . 

NS3 is associated with a cofactor, NS4A. 
NS4A seems critical to the activity of NS3, enhancing 
5 the proteolytic efficiency of NS3 at all of the 
cleavage sites. NS4A is a 54 residue amphipathic 
peptide, with a hydrophobic N-terminus and, a 
hydrophilic C-terminus [Failla, C. et al.,§ "Both NS3 
and^ NS4A are Required for Proteolytic Processing of 

10 Hepatitis C Virus Nonstructural Proteins",! J. Virol. , 
68,^ pp. 3753-3760 (1994)]. Its function appears 
complex, possibly assisting in the membranfe- 
localization of NS3 and other viral replic'ase 
components [Lin, C. et al. "A Central Regipn in the 

15 Hepatitis C Virus NS4A Protein Allows Formation of an 
Active NS3-NS4A Serine Proteinase Complex In Vivo and 
In Vitro ", J. Virol. , 69, pp. 4373-4380 (1995b); 
Shimizu, Y. et al-, "Identification cf the Sequence on 
NS4A Required for Enhanced Cleavage of the NS5A/5B Site 

20 by Hepatitis C Virus NS3 Protease", J. Virol. , 70, pp. 
127-132 (1996); Tanji, Y. et al., "Hepatitis C Virus- 
Encoded Nonstructural Protein NS4A has Versatile 
Functions in Viral Protein Processing", J. -Virol. , 69, 
pp. 1575-1581 (1995)] but its best characterized 

25 function is that of a cofactor for the NS3 protease. 

The current understanding of HCV has not led 
to satisfactory treatments for HCV infection. The 
prospects for effective anti-HCV vaccines remain 
uncertain. The only established therapy for HCV 

30 disease is interferon treatment. However, interferons 
have significant side effects [Janssen, H. L. A., et 
al. > w Suicide Associated with Alf a-Interf erq'n Therapy 
for ^Chronic Viral Hepatitis", J. Hepatol. , ^21 f pp. 241- 
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243 (1994)]; Renault, P.F. and Hoofnagle, J.H., "Side 
effects of alpha interferon. Seminars in Liver Disease 
9 f 273-277. (1989)] and induce long term remission in 
only a fraction (- 25%) of cases [Weiland, 0. 
5 "Interferon Therapy in Chronic Hepatitis C Virus 
Infection", FEMS Microbiol. Rev. , 14, pp. 279-288 
(1994)]. Thus, there is a need for more effective 
anti-HCV therapies. | 
The^NS3 protease is considered a potential? 
10 target for antiviral agents. However, drug discover^ 
efforts directed towards the NS3 protein have been | 
hampered by t$e lack of structural information about! 
NS3 and its complex with NS4A. Such structural £ 
information would provide valuable information in £ 
15 discovery of HCV N33 protease inhibitors. However, T 
efforts to determine the structure of HCV NS3 protease 
have been hampered by difficulties in obtaining 
sufficient quantities of pure active enzyme 
[Steinkuhler, C. et al v , "In Vitro Activity of. 
20 Hepatitis C Virus Protease N53 Purified from 

Recombinant Baculovirus-Inf ected Sf9 Cells", J. Biol 
Chem. , pp. 637-6273 (1996)]. There have been no 
crystals reported of any NS3 or NS3 protease domain 
protein. Thus, x-ray crystallographic analysis of such 
25 proteins has not been possible, 

SUMMARY OF THE INVENTION 

Applicants have solved this problem by 
30 providing, for : the first time, compositions comprising 
a hepatitis C virus (HCV) NS3 protease-like polypeptide 
complexed with- a NS4A-like peptide and methods for : 
making such compositions. ■> 
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The invention also provides crystals of a HCV 
NS3 protease-like polypeptide/NS4A-like peptide complex 
and methods :for making such crystals. 

The invention also provides the structure 
5 coordinates of a HCV NS3 protease-like 
polypeptide/NS4A-like peptide complex. 

The invention also provides a method for 
determining gat least a portion of the three-dimensional 
structure o§ molecules or molecular complexes which? 
10 contain at Jjjeast some structurally similar features? to 
a HCV NS3 serine protease domain* % 



30 



BRIEF DESCRIPTION OF THE FIGURES 



15 Figure 1 depicts HCV polyprotein processing. 

The location's of the HCV structural and nonstructural 
proteins are; marked on a diagram of the 3011 amino acid 
polypeptide.. -Cleavages between the structural proteins 
by cellular signal peptidases are marked by asterisks. 
20 Cleavage between NS2 and NS3 is mediated by the NS2/NS3 
metallo-protease. The NS3 serine protease is 
responsible for cleavages between NS3 and NS4A, NS4A 
and NS4B, NS4B and NS5A, and NS5A and NS5B. 

Figure 2 depicts stereo ribbon diagrams of 
25 the NS3/NS4A complex. The view is into the active site 
cleft of the enzyme. Side-chains of active site 
residues His-1083, Asp-1107, and Ser-1165, along with 
Zn ++ ligands .Cys-1123, Cys-1125, and Cys-1171 are 
displayed in *ball-and-stick representation. Zn ++ , its 
H20 ligand, $nd the p-strand formed by NS4A are also 
shown . i 



10 
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Figure 3 lists the atomic structure 
coordinates for hepatitis C virus recombinant, 
truncated nonstructural protein 3 (hereafter referred 
to as tNS3) in complex with a synthetic peptide of the 
central region of the nonstructural protein 4A 
(hereafter referred to as sNS4A) as derived by X-ray 
diffraction from crystals of that complex (hereafter 
referred to as tNS3/sNS4A) . £he preparation of the 
complex is described in Examples 1 and 2. The 
following abbreviations are used in Figure 3: 

"Atom type" refers |o the element whose 
coordinates have been determined. Elements are defined 
by the first letter in the column except for zinc which 
is defined by the letters "Zn£. 
15 "X, Y, Z" crystallographically define the 

atomic position determined for each atom. 

*B" is a thermal factor that measures 
movement of the atom around its atomic center. 

"Occ" is an occupancy factor that refers to 
20 the fraction of the molecules in which each atom 

occupies the position specified by the coordinates. A 
value of u l" indicates that each atom has the same 
conformation, i.e., the same position, in ail molecules 
of the crystal. 



30 



25 Figure 4 shows a diagram of a system used to 

carry out the instructions encoded by the storage 
medium of Figures 5 and 6. 

Figure 5 shows a cross section of a magnetic 
storage medium. 

Figure 6 shows a cross section of a 
optically-readable data storage medium. 
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DETAILED DESCRIPTION OF THE INVENTION 
The following abbreviations are used 
throughout the application: 



7\ 
J\ 




Ala — 


Alanine 


T 




Thr 


— 


Threonine 


V 




Val = 


Valine 


- C 




Cys 




Cysteine 


L 




Leu = 


Leucine 


Y 




Tyr 




Tyrosine 


I 




He = 


Isoleucine 


N 




Asn 




Asparagine 


P 




Pro = 


Proline | 


Q 




Gin 




Glutamine 


F 




Phe = 


Phenylalanine S 


D 




Asp 




Aspartic Acid 


W 




Trp = 


Tryptophan f 


E 




Glu 




Glutamic Acid 


M 




Met = 


Methionine % 


K 




Lys 




Lysine 


G 




Gly = 


Glycine £ 


R 




Arg 




Arginine 


S 




Ser = 


Serine $ 


H 




His 




Histidine 



HCV = hepatitis C virus * 



Additional definitions are set forth in the 
5 specification where necessary. 

In order that the invention described herein 
may be more fully understood, the follov;ing detailed 
description is set forth. 

Applicants have solved the above problems by 
10 providing, for the first time, crystallizable 

compositions comprising a HCV NS3 protease-like 
polypeptide in complex with a NS4A-like peptide. 

Thus, in one embodiment of this invention is 
provided a composition comprising a hepatitis C virus 
15 NS3-like polypeptide in complex with an NS4A-like 

peptide. 

The HCV NS3-like polypeptide portion of the 
complex is any polypeptide which has the serine 
protease activity of the naturally occurring HCV NS3A 
20 protease, particularly th6 ability to cleave the HCV 
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polyprotein* It includes HCV NS3, NS3 protease domain 
polypeptides and NS3 protease domain-like polypeptides. 

; As used herein, the terms "HCV NS3" and "NS3" 

refers to the hepatitis C virus nonstructural-3 protein 
.5 as defined in Lin, C. et al., "Hepatitis C Virus NS3 
Serine Proteinase: Trans-Cleavage Requirements and 
Processing Kinetics", J. Virol. , 68, pp. 8147-8157 
(1991). i 

| The term "NS3 protease domain polypeptide" 

10 refers to a truncated, serine protease port&on of NS3 

as (Refined in [Bartenschlager, R, et al., § 
"Nonstructural Protein 3 of the Hepatitis q Virus 
Encodes a Serine-Type Proteinase Required fbr Cleavage 
at t|he NS3/4 and NS4/5 Junctions", J. Virol!. , 67, pp. 
15 3835^3844 (1993); Grakoui, A. et al . "Characterization 

of the Hepatitis C Virus-Encoded Serine Proteinase: 
Determination of Proteinase-Dependent Polyprotein 
Cleavage Sites", J. Virol. , 67, pp. 2832-2843 (1993); 
Grakoui, A. et al . , Expression and Identification of 
20 Hepatitis C Virus Polyprotein Cleavage Products", J. 

Virol. , 67, pp. 1385-1395 (1993); Tomei, L. et al . , 
"NS3.. is a serine protease required for processing of 
hepatitis C virus polyprotein", J. Virol. , 67, pp. 
4017-4026 (1993)]. The disclosure cf each of these 
25 documents is herein incorporated by reference. 

The term "NS3 protease domain-like 
polypeptides" refers to polypeptides that differ from 
NS3 protease domain polypeptides by having amino acid 
deletions, substitutions, and additions, but. which 
30 retain the serine protease activity of NS3. 

Preferably, the NS3-like polypeptide in the 
compositions of this invention is tNS3, a recombinantly 
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produced hepatitis C virus protease domain protein that 
is prepared as described herein. 

■i The NS4A-like peptide portion of the 
compositions of this invention is any peptide or 
5 peptide mimetic that is capable of acting as a NS4A 
cofaqjtor for the NS3 . These include NS4A, peptide 
fragments thereof and other peptides that differ from 
NS4A by having amino acid deletions, substitutions, and 
additions, while retaining the above-described 
10 activaty. | 

| As used herein the term "NS4A" refers to the 
hepati|tis C virus nonstructural protein 4A whach acts 
as a qpfactor for NS3 protease [Failla, c. etral., 
"Both |JS3 and NS4A are Required for Proteolytic 
Processing of Hepatitis C Virus Nonstructural^ Proteins" 
J. Virbl. 68, pp. 3753-3760 (1994); Lin, C. et al . , 
"Hepatitis C Virus NS3 Serine Proteinase: Trans- 
Cleavage Requirements and Processing Kinetics" J. 
Virol. . 68, pp. 8147-8157 (1994b)] 

Preferably, the NS4A-like peptide is sNS4A, 
the synthetic peptide H-KKGSWIVGRIVLSGKPAIIPKK-OH. 
This peptide encompasses the essential NS3 protease 
domain residues of NS4A* 

Both the NS3-like polypeptide and the NS4A- 
25 like peptide may be produced by any well-known method, 
including synthetic methods, such as solid phase, 
liquid. phase and combination solid phase/liquid phase 
syntheses; recombinant DNA methods, including cDNA 
cloning, optionally combined with site directed 
mutagenesis; and/or purification of the natural 
products, optionally combined with enzymatic cleavage 
methods to produce fragments of naturally occurring NS3 
and NS4A. * S 



15 



20 



30 
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According to a preferred embodiment, the 
compositions of this invention are crystallizable . In 
this preferred embodiment all of the preferred choices 
t for the NS3-like polypeptide and the NS4A-like peptide 

;■ , 5 are identical to those indicated above. 

) Advantageously, the crystallizable 

\ composition provided by-this invention are amenable to 

| x-ray crystallography. |Thus, this invention also 

| provides the three-dimetfsional structure of an HCV NS3- 

i l 

I 10 like polypeptide/NS4A-li|ke peptide complex, 

>f & 

% specifically an HCV tNs|/sNS4A complex, at 2 . 5 A 

I resolution. Importantly, this has provided for the 

| first time, information ^about the shape and structure 

I of the NS3 protease domain • 

j 15 The three-dime v nsional structure of the HCV 

: i tNS3/sNS4A complex of tlris invention is defined by a 

- set of structure coordinates as set forth in Figure 3. 

The term "structure coordinates" refers to Cartesian 
coordinates derived from mathematical equations related 
20 to the patterns obtained on diffraction of a 

monochromatic beam of X-rays by the atoms (scattering 
centers) of an tNS3/sNS4A complex in crystal form. The 
diffraction data are used to calculate an electron 
density map of the repeating unit of the crystal. The 
25 electron density maps are then used to establish the 
positions of the individual atoms of the tNS3/sNS4A 
enzyme or enzyme complex. 

Those of skill in the art will understand 
that a set of structure coordinates for an enzyme or an 

V 30 enzyme-complex or a portion thereof, is a relative set 
I of points that define a shape in three dimensions. 

V Thus, it is possible that an. entirely different set of 
?j coordinates could define ;ia similar or identical shape. 
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Moreover, slight variations in the individual 
coordinates will have little effect on overall shape. 

The variations in coordinates discussed above 
may be generated because of mathematical manipulations 
5 of the structure coordinates. For example, the 

structure coordinates ^set forth in Figure 3 could be 
manipulated by crystallographic permutations of the 
structure coordinates^ f ractionalization of the 
structure coordinates! integer additions or 

10 subtractions to sets of the structure coordinates, 
inversion of the structure coordinates or any 
combination of the abdve. 

Alternatively, modifications in the crystal 
structure due to mutations, additions, substitutions, 

15 and/or deletions of ariino acids, or other changes in 
any of the components S'that make up the crystal could 
also account for variations in structure coordinates. 
If such variations are within an acceptable standard 
error as compared to the original coordinates, the 

20 resulting three-dimensional shape is considered to be 
the same. 

Various computational analyses are therefore 
necessary to determine whether a molecule or molecular 
complex or a portion thereof is sufficiently similar to 

25 all or parts of the NS3-like polypeptide/NS4A-like 

peptide structure described above as to be considered 
the same. Such analyses may be carried out in current 
software applications, such as the Molecular Similarity 
application of QUANTA .(Molecular Simulations Inc., San 

30 Diego, CA) version 4.1> and as described in the 
accompanying User's Guide. 

The Molecular Similarity application permits 
comparisons between different structures, different 
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.conformations of the same structure, and different 
; parts of the same structure. The procedure used in 
Molecular Similarity to compare structures is divided 
into four steps: 1) load the structures to be 
5 pompared; 2) define the atom equivalences in these 
structures; 3) perform a fitting operation; and 4) 
analyze the results . 

| Each structure is identified Ipy a name. One 

|tructure is identified as the target (l.e. f the fixed 
§ . fi 
10 ^structure) ; all remaining structures are working 
S . % 

structures (i.e., moving structures). Since atom 

Squi valency within QUANTA is defined byfuser input, for 

the purpose of this invention we will dejfine equivalent 

t k 

atoms as protein backbone atoms (N, Cot, |c and O) for 

15 |ll conserved residues between the two structures being 
compared. We will also consider only rigid fitting 
operations . 

When a rigid fitting method is used, the 
working structure is. translated and rotated to obtain 

20 an optimum fit with the target structure. The fitting 
operation uses an algorithm that computes the optimum 
translation and rotation to be applied to the moving 
structure, such that the root mean square difference of 
the fit over the specified pairs of equivalent atom is 

25 an absolute minimum. This number, given in angstroms, 
is reported by QUANTA. 

For the purpose of this invention, any 
mplecule or molecular complex that has a. root mean 
s.quare deviation of conserved residue backbone atoms 

30 (N, Ca, C, 0) of less than 1.5 A when superimposed on 
the relevant backbone atoms described by' structure 
coordinates listed in Figure 3 are considered 
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identical. More preferably, the root mean square 
deviation is less than 1.0 A. 

The term "root mean square deviation" means 
the square root of the arithmetic mean of the squares 
5 of the deviations from the mean. It is a way to 

express the deviation :;or variation from a trend or 
object. For purposes 5 of this invention, the "root mean 

square deviation" defines the variation in the backbone 

**. 

of a protein or proteifn complex from the relevant 

10 portion of the backborfe of the NS3-like polypeptide 
portion of the comple* as defined by the structure 
coordinates described {herein.* 

Once the structure coordinates of a protein 
crystal have been determined they are useful in solving 

15 the structures of other crystals. 

Thus, in accordance with the present 
invention, the structure coordinates of a NS3-like 
polypeptide/NS4A-like peptide complex, and in 
particular a tNS3/sNS4A complex, and portions thereof 

20 is stored in a machine-readable storage medium. Such 
data may be used for a variety of purposes, such as 
drug discovery and x-ray crystallographic analysis or 
protein crystal. 

Accordingly, in one embodiment of this 

25 invention is provided a machine-readable data storage 
medium comprising a data storage material encoded with 
the structure coordinates set forth in Figure 3. 

Figure 4 demonstrates one version of these 
embodiments. System 10 includes a computer 11 

30 comprising a central processing unit {"CPU") 20, a 

working memory 22 which may be, e.g, RAM (random-access 
memory) or "core'' memory/ mass storage memory 24 {such 
as one or more disk drives or CD-ROM drives), one or 
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more cathode-ray tube ("CRT") display terminals 26, one 
or more keyboards 28, one or more input lines 30, and 
one or more output lines 40, all of which are 
: . interconnected by a conventional bidirectional system 
5 ; bus 50. 

;i Input hardware 36, coupled to computer 11 by 

} input lines 30, may be implemented in ; a variety of 
| ways- Machine-readable data of this indention may be 
| inputted via the use of a modem or modems 32 connected 
10 i by a telephone line or dedicated data |ine 34. 

| Alternatively or additionally, the inpiit hardware 36 
I may comprise CD-ROM drives or disk drives 24. In 
| conjunction with display terminal 26, keyboard 28 may 
fe also be used as an input device. 
15 | Output hardware 46, coupled to computer 11 by 

^output lines 40, may similarly be implemented by 
t conventional devices. By way of example, output 
^hardware 46 may include CRT display terminal 26 for 

displaying a graphical representation of a binding 
20 .pocket of this invention using a program such as QUANTA 
■as described herein. Output hardware might also 

include a printer 42, so that hard copy output may be 

produced, or a disk drive 24, to store system output 

for later use . 

25 In operation, CPU 20 coordinates the use of 

the various input and output devices 36, 4 6, 
coordinates data accesses from mass storage 24 and 
accesses to and from working memory 22, and determines 
the sequence of data processing steps. A number of 

30 programs may be used to process the machine-readable 

;data of this invention. Such programs are discussed in 
preference to the computational methods of drug 
discovery as described herein. Specific references to 
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components of the hardware system 10 are included as 
appropriate throughout the following description of the 
data storage medium. 

Figure 5 shows a cross section of a magnetic 
5 data storage medium 100 which can be encoded with a 
machine-readable data that can be carried out by a 
system such as system 10 of Figure 4. Medium 100 can 

be a conventional floppy diskette or hard disk, having 

4 

a suitable substrate|101 , which may be conventional, 
10 and a suitable coati|g 102, which may be conventional, 

on one or both sides? containing magnetic domains (not 

4; 

visible) whose polarity or orientation can be altered 
magnetically. Medium 100 may also have an opening (not 
shown) for receiving tthe spindle of a disk drive or 

15 other data storage device 24. 

The magnetic domains of coating 102 of medium 
100 are polarized or -'oriented so as to encode in manner 
which may be conventional, machine readable data such 
as that: described herein, for execution by a system 

2*0 such as system 10 of Figure 4. 

Figure 6 shows a cross section of an 
optically-readable data storage medium 110 which also 
can be encoded with such a machine-readable data, or 
set of instructions, which can be carried out by a 

25 system such as system 10 of Figure 4. Medium 110 can 
be a conventional compact disk read only memory 
(CD-ROM) or a rewritable medium such as a 
magneto-optical disk which is optically readable and 
magneto-optically writable. Medium 100 preferably has 

30 a suitable substrate 111, which may be conventional, 

and a suitable coating 112, which may be conventional, 
usually of one side of substrate 111. 
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In the case of CD-ROM, as is well known, 
coating 112 is reflective and is impressed with a 
plurality of pits 113 to encode the machine-readable 
data. The arrangement of pits is read by reflecting 
. 5 laser light off the surface of coating 112. A 
protective coating ;ll 4, which preferably is 
substantially transparent, is provided on top of 
coating 112. | 

In the ci-se of a magneto-optical disk, as is 

10 well known, coating! 112 has no pits 113, but has a 
plurality of magnetic domains whose polarity or 
orientation can be |changed magnetically when heated 
above a certain temperature, as by a laser (not shown) . 
The orientation of -jfthe domains can be read by measuring 

15 the polarization o^ laser light reflected from coating 
112. The arrangement of the domains encodes the data 
as described above. 

For the first time, the present invention 
permits the use of structure-based or rational drug 

20 design techniques to design, select, and synthesize 

chemical entities, including inhibitory compounds that 
are capable of binding to KCV NS3, NS4A, NS3/NS4A 
complex, or any portion thereof. 

One particularly useful drug design technique 

25 enabled by this invention is iterative drug design. 
Iterative drug design is a method for optimizing 
associations between a protein and a compound by 
determining and evaluating the three-dimensional 
structures of successive sets of protein/compound 

30 complexes. « 

Those of skill in the art will realize that 
association of natural ligands or substrates with the 
binding pockets of their corresponding receptors or 
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enzymes is the basis of many biological mechanisms of 
action. The term "binding pocket", as used herein, 
refers to a region of a molecule or molecular complex, 
that, as a result of its shape, favorably associates 
with another chemical entity or compound. Similarly, 
many drugs exert their biological effects through 
association with the binding pockets of Receptors and 
enzymes. Such associations may occur with all or any 



10 



parts of the binding pockets. An understanding of such 

associations will help lead to the design of drugs 

. § ■ • 

having more favorable associations with €heir target 

receptor or enzyme, and thus, improved biological 

effects. Therefore, this information is jgvaluable in 

designing potential ligands or inhibitor! of receptors 

15 or; enzymes, such as inhibitors of HCV NS3r-like 
polypeptides, and more importantly HCV NS-3 . 

The term "associating with" refers to a 
condition of proximity between chemical entities or 
compounds, or portions thereof. The association may be 

20 non-covalent wherein the juxtaposition is 

energetically favored by hydrogen bonding or van der 
Waals or electrostatic interactions or it may be 
covalent . 

. In iterative drug design, crystals of a 
25 series of protein/compound complexes are obtained and 
then the three-dimensional structures of each complex 
is. solved. Such an approach provides insight into the 
association between the proteins and compounds of each 
copplex. This is accomplished by selecting compounds 
with inhibitory activity, obtaining crystals of this 
new protein/compound complex, solving the 'three- 
dimensional structure of the complex, and; comparing the 
associations between the new protein/ compound complex 
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and previously solved protein/compound complexes. By 
observing how changes in the compound affected the 
protein/compound associations, these associations may 
be optimized. : : 
5 ; In some cases, iterative drug design is 

carried out by forming successive protein^compound 
complexes and then crystallizing each new^complex. 
Alternatively, a pre-formed protein crystal is soaked 
in|the presence of an inhibitor, thereby forming a 

10 pr|tein/compound complex and obviating th| need to 

cr|stallize each individual protein/compound complex. 
Advantageously, the HCV NS3-like polypept|de/NS4A-like 
peptide crystals, and in particular the tNS3/sNS4A 
crystals, provided by this invention may ife soaked in 

15 thej presence of a compound or compounds, sjbch as NS3 
protease inhibitors, to provide NS3-like ? 
polypeptide/NS4A-like peptide /compound crystal 
complexes . 

: As used herein, the term "soaked" refers to a 

20 process in which the crystal is transferred to a 
solution containing the compound of interest. 

In another embodiment of this invention is 
provided a method for preparing a composition 
comprising a NS3-like polypeptide protein comprising 
25 the steps described in Examples 1 and 2. Preferably, 
the composition comprises a NS3-like polypeptide in 
complex with a NS4A-like peptide. 

The structure coordinates set forth in Figure 
3 can also be used to aid in obtaining structural 
information about another crystallized molecule or 
molecular complex. This may be achieved by any of a 
number of well-known techniques, including ^molecular 
replacement. & 
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The structure coordinates set forth in Figure 
3 can also be used for determining at least a portion 
of the three-dimensional structure of molecules or 
molecular complexes which contain at least some 
5 * structurally similar features to HCV NS3 . In 

particular, structural information about another 
crystallized molecule or molecular complex may be 
obtained . This may be achieved by any of a number of 
well-known techniques, including molecular replacement. 
10 Therefore, infanother embodiment this 

i 

invention provides a method of utilizing molecular 
replacement to obtain structural information about a 

crystallized molecule or molecular complex whose 

% 

structure is unknown comprising the steps of: 

15 a) generating tan X-ray diffraction pattern 

from said crystallized molecule or molecular complex; 
and 

b) applying at least a portion of the 
structure coordinates set forth in Figure 3 to the 
20 X-ray diffraction pattern to generate a 

three-dimensional electron density map of the molecule 
or molecular complex whose structure is unknown. 

Preferably, the crystallized molecule or 
molecular complex comprises a NS3-like polypeptide and 
25 a NS4A-like peptide. More preferably, the crystallized 
molecule or molecular complex is obtained by soaking a 
crystal of this invention in a solution. 

By using molecular replacement, all or part 
of the structure coordinates of the tNS3/sNS4A complex 
30 provided by this invention (and set forth in Figure 3) 
can be used to determine the structure of a 
crystallized molecule of molecular complex whose 
structure is unknown mofe quickly and efficiently than 
attempting to determine #such information ab initio . 

5b 

r.- 
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Molecular replacement provides an accurate 
estimation of the phases for an unknown structure. 
Phases are a factor in equations used to solve crystal 
structures that can not be determined directly. 
5 Obtaining accurate values for the phases, by methods 
other than molecular replacement, is a time-consuming 
process that involves .iterative cycles of 
approximations and refinements and greatly hinders the 
solution of crystal structures. However, when the 

10 crystal structure of a protein containing at least a 

homologous portion has; been solved, the phases from the 
known structure provide a satisfactory estimate of the 
phases for the unknown structure. 

Thus, this method involves generating a 

15 preliminary model of a> molecule or molecular complex 
whose structure coordinates are unknown, by orienting 
and positioning the relevant portion of the tNS3/sNS4A 
complex according to Figure 3 within the unit cell of 
the crystal of the unknown molecule or molecular 

20 complex so as best to account for the observed X-ray 
diffraction pattern of the crystal of the molecule or 
molecular complex whose structure is unknown. Phases 
can then be calculated from this model and combined 
with the observed X-ray diffraction pattern amplitudes 

25 to generate an electron density map of the structure 
whose coordinates are unknown. This, in turn, can be 
subjected to any well-known model building and 
structure refinement techniques to provide a final, 
accurate structure of the unknown crystallized molecule 

30 or molecular complex [E. Lattman, "Use of the Rotation 
and Translation Functions", in Meth. Enzymol ., 115, pp. 
55-77 (1985); M. G. Rossmann, ed., "The Molecular 
Replacement Method", Int. Sci. Rev. Ser. , No. 13, 
Gordon & Breach, New York (1972)]. 
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The structure of any portion of any 
crystallized molecule or molecular complex that is 
sufficiently homologous to any portion of the 
tNS3/sNS4A complex can be solved by this method. 
5 - In a preferred embodiment, the method of 

molecular replacement is utilized to obtain; structural 
information about a molecule or molecular complex, 
wherein the complex comprises a NS3-like polypeptide. 
Preferably the NS3-like polypeptide is tNSlf or 

10 homq&ogue thereof. I 

4 I. 

S' The structure coordinates of tNS3?/sNS4A as 

prodded by this invention are particularly: useful in 

solving the structure of other crystal fornfs of NS3- 

like; polypeptide, preferably other crystal fforms of 

15 tNS3£ NS3-like polypeptide/NS4A-like peptide, 

preferably tNS3/sNS4A; or complexes comprising any of 

the above. 

The structure coordinates are also 
particularly useful to solve the structure of crystals 

20 of NS3-like polypeptide/NS4A-like peptide complexes, 

particularly tNS3/sNS4A, co-complexed with a variety of 
chemical entities. This approach enables the 
determination of the optimal sites for interaction 
between chemical entities, including interaction of 

25 candidate NS3 inhibitors with NS3 or the NS3/NS4A 
complex. For example, high resolution X-ray 
diffraction data collected from crystals exposed to 
different types of solvent allows the determination of 
where each type of solvent molecule resides. Small 

30 molecules that bind tightly to these sites can then be 
designed and synthesized and tested for' their NS3 
inhibition activity. : 
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All of the complexes referred to above may be 
studied using well-known X-ray diffraction techniques 
and may be refined versus 1.5-3 A resolution X-ray data 
to an R value of about, 0.20 or less using computer 
5 software, such as X-PLOR [Yale University, ©1992, 

distributed by Molecular Simulations, Inc.; see, e.g., 
Blundell & Johnson, supra ; Meth. Enzymol ., vol. 114 & 
115, H . W. Wyckoff et §1., eds., Academic Fiess 
(1985)]. This information may thus be used to optimize 
known NS3 inhibitors, |nd more importantly, to design 
new NS3 inhibitors. I 

In order thaf this invention be more fully 
understood, the following examples are set forth. 
These examples are for|the illustrative purposes only 
15 and are not to be construed as limiting the scope of 
this invention in any way. 

EXAMPLE 1 
Expression and Purification of tNS3 
20 The truncated NS3 serine protease domain 

(tNS3) was cloned from a cDNA of the hepatitis C virus 
H strain [Grakoui, A. et al., "Expression and 
Identification of Hepatitis C Virus Polyprotein 
Cleavage Products", J. Virol. , 67, pp. 1385-1395 
25 (1993)]. The first 181 amino acids of NS3 (residues 

1027-1207 of the viral polyprotein) have been shown to 
contain the serine protease domain of NS3 that 
processes all four downstream sites of the HCV 
polyprotein [Lin, C, et al., Hepatitis C Virus NS3 
Serine Proteinase: Trans-Cleavage Requirements and 
Processing Kinetics", J\ Virol. 68, pp. 8147-8157 
(1994b)], so we expressed a (His) 6 -fusion protein based 
on this tNS3. The plasipid pET-BS ( + ) /HCV/T7-NS3i 81-His 
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was derived from pTM3/HCV/l 027-1207 (NS3]81) (Id.), by 
using polymerase chain reaction to introduce epitope 
tags and new restriction sites. A T7-tag (ASMTGGQQMG) , 
from the N-terminus of the gene 10 protein of the T7 
5 bacteriophage 

[Tsai, D.E. et al., "In Vitro Selection of an RNA 
Epitope Immunologically Cross-Reactive with a Peptide", 
Proc. Natl. Acad. Sci. USA , 89, pp. 8864|8868 (1992)], 
was placed at the N-terminus of the tNS3|domain. Two 

10 linker residues (GS) were placed at the |NS3 C- 
t|rminus, followed by the (His) 6 -tag. sfcoii 
J^109(DE3) cells, freshly transformed with the pET- 
BS(+)/HCV/T7-NS3i8l-His plasmid, were grlwn at 37 °C in 
complex media supplement with 100 pg/ml ampicillin, in 

15 aJlO L fermentor (Braun) . When the cell /density 

reached an OD600 of 3-4 the temperature Qf the culture 
was rapidly reduced to 30 °C, and induction. was 
immediately initiated by the addition of 1 mM IPTG. 
Cells were harvested at 2 h post-induction, and flash 

20 frozen at -70 °C prior to purification. 

The tNS3 was purified from the solu.Dle 
fraction of the recombinant E.coli iysates as follows, 
with all procedures being performed at 4 °C unless 
stated otherwise. Cell paste (75-100g) was resuspended 

25 in 15 volumes of 50 mM HEPES, 0.3 M NaCl, 10% glycerol, 
0.1% fi-octyl glucoside, 2 mM fi-mercaptoethanol, pH 8.0. 
Cells were ruptured using a microf luidizer and the 
homogenate was clarified by centrif ugation at 100,000 x 
g for 30 min. The supernatant was brought to 50 mM 

30 HEPES, 20 mM imidazole, 0.3 M NaCl, 27 . 5%, glycerol , 
0.i% Jl-octyl- glucoside, 2 mM fl-mercaptoefchanol, pH 

8.10, and applied at 1.0 ml/min to a 7.0 ml Ni-Agarose 

V I 
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affinity column, equilibrated in the same buffer. 
After loading, the column was washed with 10-15 volumes 
of equilibration buffer and the bound proteins were 
eluted with equilibration buffer containing 0.35 M 
imidazole. The protein was then size-fractionated on 
two columns in series (each 2.6 cm x 90 cm) packed with 
Pharmacia high resolution S100 resin and equilibrated 
with 25 mM HEPES, o|3M NaCl, 10% glycerol, 0.1% J5- 

octylglucoside, 2 mMj JJ-mercaptoethanol, pH 8.0. The 

ft 

tNS3 fractions, identified by SDS-PAGE, were pooled and 
concentrated to 1 mg|/ml using a Amicon Centriprep-10, 
and stored at -70°C.| The tNS3 was thawed slowly on ice 
and the NS4A peptide| (dissolved in the size-exclusion 
chromatography buffejir) was added at a tNS3 :NS4A-peptide 
molar ratio of 1:2. -'The sample was then diluted 2.5- 
fold with 15 mM MES/. 0.5 M NaCl, 20 mM fi- 
mercaptoethanol, pH6. 5, and concentrated to -2 ml (~ 2 
mg/ml) by ultrafiltration. The sample was then diluted 
2-fold with the pH 6.5 buffer and concentrated again to 
20 -2 ml. This dilution process was repeated until it 
gave a >40-fold dilution of the original buffer 
constituents. The protein sample was then concentrated 
to 13.0 mg/ml and centrifuged at ~300,000 x g for 20 
min at 4 °C. Concentrations of the pure tNS3 and 
25 tNS3/4A complex were determined by UV absorption 

spectroscopy, using a molar absorption coefficient 
(A28O) of 17,700 M"l. cirri. 
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EXAMPLE 2 

4A Peptide Synthesis and Purification 
The HCV NS4A peptide was synthesized to span 
residues Gly21 to Pro?9 of the viral cof actor' (residues 
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1678 to 1696 of the HCV polyprotein) , which 
incorporates the essential region reported to be 
essential for NS3 stimulation [Lin, C. et al. "A 
Central Region in the Hepatitis C Virus NS4A Protein 
5 Allows Formation of an Active NS3-NS4A Serine 

Proteinase Complex In Vivo and In Vitro ", J, Virol. 69, 
pp. 4373-4380 (1995)]. Lysine residues were added to 
the termini to assist aqueous solubility, and a serine 
residue was substituted for Cys22 (residue .1679 of the 
10 polyprotein of the HGV H strain) . The peptide (H-. 

KKGSWIVGRIVLSGKPAIIPKK-OH « TFA salt) was prepared by 
the solid-phase peptide synthesis (Applied Biosystems 
433A) beginning with§N a -Fmoc, N e -Boc-Lys Wang resin. 
N a -Fmoc-protected amino acids were added sequentially 

15 using HBTU (2- (lH-benzotriazol-l-yl) 1 , 1, 3, 3- 

tetramethyluronium hexaf luorophosphate) with HOBt (1- 
hydroxybenzotriazole -hydrate) as coupling agents in N- 
methylpyrrolidinone. ; Cleavage from the resin and 
global deprotection were accomplished with 95% 

20 trif luoroacetic acid and 5% water at room temperature 
for 1.5 hr (15 ml/ g resin). The peptide was purified 
by preparative HPLC on a Waters Delta Pak C18, 15 p ; 
30CA column (30 mm x 300 mm) eluting with a linear 
gradient of acetonitrile (15-40%) in 0.1% aqueous 

25 trif luoroacetic acid over 35 min (flow rate of 22 

ml/min) . Peptide purity was confirmed by analytical 
HPLC. The sequence was confirmed by direct N- terminal 
sequence analysis and matrix-assisted' laser desorption 
mass spectrometry (Kratos MALDI I), which showed the 

30 correct (M + H) + and MM + Na) + molecular ions. 
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EXAMPLE 3 
Crystallization and Data Collection 
Crystals of the tNS3/NS4A complex were grown 
by hanging-drop vapor diffusion over aireservoir of 0.1 
.- : M MES, 1.8 M NaCl, 0.1 M sodium/potassium phosphate, 10 
|mM Jl-mercaptoethanol, pH 6.5. The crystals grew over 
Sthe course of 2-3 weeks, to final dimensions of about 
|0.1 x 0.1 x 0.25 mm. The rhombohedral Jbrystals used in 
fthis study belonged to space group R32/1 with unit cell 
fdimensions a=b=225. OA, and c=75.5A, and contained two 

|p. ■ 'iff 

§tNS3/NS4A complexes per asymmetric unit|. 
% $ 
| Statistics for data collecticfh, heavy atom 

^refinement, and crystallographic refinement are given 

|in Table 1. All heavy atom soaks were Sione in hanging- 

15 Jdrops over the same reservoir as used for 

^crystallization. Crystals were transferred to a 
^stabilizing solution (50 mM MES, 2.0 H NaCl, 0.1 M 
sodium/potassium phosphate, 10 mM fl-mercaptoethanol, 
and 20% glycerol, pH 6.2) and then frozin in a dry 

20 nitrogen gas stream at 100 K (Molecular Structure 
Corp., Houston, TX) for data collection* Data was 
acquired by oscillation photography on a Rigaku R-AXIS 
I1C phosphor imaging area detector mounted on a Rigaku 
RU200 rotating anode generator (MSC) , operating at 50kV 

25 and 100mA. Measured intensities were integrated, 

scaled, and merged using the HKL software package (Z. 
Otwinowski and W. Minor) ♦ 
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EXAMPLE 4 

i; Phasing, Model Building and Refinement 

f Heavy atom positions were located by 

Inspection and confirmed with difference, Fourier 

Syntheses. Heavy atom parameters were refined and 

& k 
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i * 
phases computed to 3.lA using the program PHASES 
[Furey, W. and Swaminathan, S. "PHASES- 95: a program 
package for the processing and analysis of diffraction 
data^from macromolecules", Meth. Enzymol . , (1996) . MIR 
5 phasfs were improved and extended to 2.7A by, cycles of 
solvent flattening [Wang, B.C., "Resolution ipf Phase 
Ambiguity in Macromolecular Crystallography^ , Methods 
in Enzymol. 115, pp. 90-112 (1985)] combined with 
hist(|gram matching [Zhang, K.Y.J, and Main, Ip., "The 
10 Use <|f Sayre's Equation With Solvent Flattening and 

Histogram Matching for Phase Extension and Refinement 
of Pjfotein Structures", Acta Crystallogr . , j?4 6, pp. 
377-^81 (1990)] using the CCP4 crystallograppiic package 
(Collaborative Computation Project, 1994). tThe 
15 resulting electron density map displayed nearly 

continuous density for the protein backbone -as well as 
strong side chain density. Approximately 80% of the 
model could be unambiguously built into this, map 
(QUANTA 4.1, Molecular Simulations), and a single round 
20 of simulated annealing refinement in X-PLOR [Brunger, 
A. T .;, "X-PLOR: A System for X-Ray Crystallography and 
NMR",. New Kaven, Connecticut: Department of Molecular 
Biophysics and Biochemistry, Yale University (1993)] 
brought the R-factor to 29% and free R value to 33% 
25 [Brunger, A. T., "Free R Value: A Novel Statistical 
Quantity for Assessing the Accuracy of Crystal 
Structures", Nature , 355, pp. 472-475 (1992)]. The 
remainder of the model was built and refined, in several 
steps, by first extending the resolution to 2 . 5A and 
then adding well-ordered water molecules. A;; final 
roundjof positional and individual temperature factor 
refinement brought the R-factor to 21.6% (free R value 
26.1%| for 26,652 reflections between 6.0 and 2 . 5A 
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(F>lsF) . The current model consisted of tNS3 residues 
1055-1206 and NS4A residues 1678-1693 in complex A, and 
tNS3 residues 1028-1206 and. NS4A residues 1678-1696 for 
complex B (polyprotein numbering, with 2 zinc atoms 
5 and 130 water molecules. A Ramachandran plot for the 
final model contained fl% of the residues in the most 
favored regions and 03^ in disallowed or generously- 
allowed regions. Therms deviations from ideality were 
C.007A for bond lengths and 1.47° for bond angles. 
10 While we havfe described a number of 

embodiments of this invention, it is apparent that our 
basic examples may be Altered to provide other 
embodiments which utilize the products and processes of 

S 

this invention. Therefore, it will be appreciated that 
15 the scope of this invention is to be defined by the 

appended claims rather 1 ; than by the specific embodiments 
which have been represented by way of example. 
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CLAIMS 

We claim: 

I 1. A composition comprising a HCV NS3-like 
5 polypeptide complexed with a NS4A-like peptide. 

i 2. The composition according to claim 1, 
wherein the HCV, NS3-like polypeptide is a NS}3 protease 
domain polypeptide or a NS3 protease domain-flike 

10 polypeptide. I 

if i 

■I £ 

% ■ i 

| 3. The composition according to claim 1, 

wherein the HCV NS3-like polypeptide is tNsi|. 

§ ■ % 

15 >> 4. The composition according to any one of 

claims 1 to 3, wherein the NS4A-like peptide; is H- 
KKGSVVT VGRI VLSGKPAI I PKK-OH . 

5. A crystal comprising a HCV NS3-like 
20 polypeptide complexed with a NS4A-like peptide. 

6. The crystal according to claim 5, wherein 
the HCV NS3-like polypeptide is a NS3 protease domain 
polypeptide or a NS3 protease domain-like polypeptide. 



25 



7. The crystal according to claim 5, wherein 
the HCV NS3-like polypeptide is tNS3. 



8. The crystal according to any one of 
30 claims i to 3. wherein the NS4A-like peptide < is H- 
KKGSVVlVGRIVLSGKPAIIPKK-OH. : 
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9, The crystal according to claim 5, 
additionally comprising an inhibitor of HVC NS3. 

10 • A machine-readable data storage medium, 
5 comprising a data storage material encoded with machine 
readable data, wherein <;the data is defined by the 

structure coordinates of a tNS3/sNS4A complex according 

% 

to Figure 3, or a homologue of said complex, wherein 

$ 

said homologue comprises backbone atoms that have a 
10 root mean square deviation from the backbone atoms of 
the complex of not mor| than 1.5A 

f 

11. The machine-readable data storage 

medium, according to c]faim 10, wherein said molecule or 

I 

15 molecular complex is defined by the set of structure 

coordinates for tNS3/sNS4A according to Figure 3, or a 
homologue of said molecule or molecular complex, said 
homologue having a root mean square deviation from the 
backbone atoms of said amino acids of not more than 1.5 

20 A. 

12. A machine-readable data storage medium 
comprising a data storage material encoded with a first 
set of machine readable data comprising a Fourier 

25 transform of at least .a portion of the structural 
coordinates for tNS3/sNS4A according to Figure 3; 
which, when combined with a second set of machine 
readable data comprising an X-ray diffraction pattern 
of a molecule or molecular complex of unknown 

30 structure, using a machine programmed with instructions 
for using said first set of data and said second set of 
data, can determine at ^east a portion of the structure 
coordinates corresponding to the second set of machine 
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readable data, said first set of data and said second 
set of data. 

13. A method of obtaining structural I 
5 information about a molecule or a molecular complex of 
unknown structure by using the structure coordinates 
set forth in Figure 3, comprising the steps of: f 

§. generating X-ray diffraction data from 
said crystallized molecule or molecular complex; f 
10 #• applying at least a portion of the! 

structure coordinates set forth in Figure 3 to said 
X-ray diffraction pattern to generate a f 
three-dimensional electron density map of at least a 
portion ofjthe molecule or molecular complex. I 



-i $ 

14. The method according to claim 13, 'l 
wherein the molecule or molecular complex of unkndwn 
structure comprises a polypeptide selected from a ,NS3- 
like polypeptide in complex with a NS4A-like peptide. 
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tNS3 COORDINATES (Complex A) 
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70.636 1.00 31.88 
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N 


PRO 


1096 


102.973 


66.668 


30.959 


1.00 


24.82 


ATOM<1616 


CD 


PRO 


1096 


104.071 


67.640 


31 .038 


1.00 


24.20 


ATOMv;1617 


A 

CA 


PRO 


1096 


101.720 


67.258 


31.419 


1.00 


24.35 


ATOM 1618 


CB 


PRO 


1096 


102.163 


68.611 


31.970 


1.00 


23.18 


ATOM,: 161 9 


CG 


PRO 


1096 


103.321 


68.948 


31.117 


1.00 


25.11 


ATOMM620 


C 


PRO 


1096 


100.732 


67.401 


30.268 


1.00 


24.48 


ATOMll621 

*<* 


O 


PRO 


1096 


101.125 


67.569 


29.116 


1.00 


25.88 


ATOM^1622 


N 


VAL 


1097 


99.450 


67.270 


30.582 


1.00 


25.45 


ATOMS 1623 


CA 


VAL 


1097 


98.393 


67.386 


29.595 


1.00 


24.09 


ATOM! 1624 


CB 


VAL 


1097 


97.598 


66.071 


29.520 


1.00 


22.86 


ATOM? 1625 


CG1 


VAL 


1097 


96.393 


66.224 


28.609 


1.00 


21.59 


AT0M|1626 


CG2 


VAL 


1097 


98.510 


64. 96^ 


29.009 


1.00 


23.88 


ATOMl 1627 


C 


VAL 


1097 


97.495 


68.563 


29.973 


1.00 


24.81 


ATOM! 1628 


O 


VAL 


1097 


97.051 


68.675 


31.113 


1.00 


25.35 


ATOM|1629 


N 


ILE 


1098 


97.298 


69.47$ 


29.032 


1.00 


25.32 


ATOMl 1630 


CA 


ILE 


1098 


96.474 


70.666 


29.252 


1.00 


25.37 


ATOM'h631 


CB 


ILE 


1098 


96.731 


71 .69$ 


28.135 


1.00 


27.24 


ATOM $1 632 


CG2 


ILE 


1098 


96.029 


73.015 


28.447 


1.00 


26.41 


ATOM i 1633 


CG1 


ILE 


1098 


98.247 


71 .932 


28.008 


1.00 


30.74 


ATOM 1634 


CD1 


ILE 


1098 


98.659 


72.943 


26.946 


1.00 


32.59 


ATOM 1635 


C 


ILE 


1098 


94.986 


70.313 


29.331 


1.00 


23.54 


ATOM 1636 


O 


ILE 


1098 


94.546 


69.325 


28.752 


1.00 


24.20 


ATOM 1637 


N 


GLN 


1099 


94.212 


71.120 


30.043 


1.00 


20.79 


ATOM 1638 


CA 


GLN 


1099 


92.787 


70.853 


30.193 


1.00 


20.64 


ATOM 1639 


CB 


GLN 


1099 


92.188 


71 .774 


31.236 


1.00 


16.40 


ATOM 1640 


CG 


GLN 


1099 


92.897 


71 .747 


32.534 


1.00 


17.16 


ATOM 1641 


CD 


GLN 


1099 


92.215 


72.607 


33.541 


1.00 


20.14 


ATOM 1642 


OE1 


GLN 


1099 


91.129 


73.118 


33.286 


1.00 


25.87 


ATOM 1643 


NE2 


GLN 


1099 


92.827 


72.767 


34.704 


1.00 


/t A Oft 

21.32 


ATOM 1644 


C 


GLN 


1099 


91 .973 


70.990 


28.918 


1.00 


22.84 


ATOM 1645 


0 


GLN 


1099 


92.217 


71 .882 


28.109 


1.00 


25.86 


ATOM 1646 


N 


MET 


1100 


91 .006 


70.090 


28.752 


1.00 


23.50 


ATOM 1647 


A^ A 

CA 


MET 


1100 


A*^ ^% ^% A*\_ A^X 

90.099 


70.094 


27.606 


1.00 


22.12 


ATOM 1648 


CB 


MET 


1100 


ns\ a a c\ 

89.418 


68.737 


r\~j a A ^ 

27.447 


1.00 


20.94 


ATOM ; 1 649 


CG 


MET 


1100 


90.052 


67.816 


26.450 


1.00 


22.51 


ATOM 1650 


^\ i->\ 
SD 


k A^ mm V 

MET 


1100 


88.916 


66.522 


25.981 


1.00 


22.15 


ATOM t.1651 


CE 


MET 


1100 


88.786 


65.731 


27.470 


1.00 


23.27 


ATOM 1652 


C 


MET 


1100 


89.002 


71.124 


27.819 


1.00 


23.05 


ATOM (1653 


0 


MET 


1100 


88.581 


71 .801 


26.879 


1.00 


24.71 


ATOM V| 654 


N 


TYR 


1101 


88.495 


71.180 


29.047 


1.00 


22.24 


ATOM .?I655 


CA 


TYR 


1101 


87.424 


72.098 


29.402 


1.00 


20.19 


ATOM 51656 


CB 


TYR 


1101 


86.103 


71.340 


29.561 


1.00 


19.75 


ATOM *1 657 


A~\A*\ 

CG 


TYR 


1101 


85.809 


70.388 


28.431 


1.00 


21.76 


ATOM jl 658 


CD1 


TYR 


1101 


86.004 


69.022 


28.582 


1.00 


20.14 


ATOM fl 659 

X 
c- 

& 


CE1 


TYR 


1101 




68.146" 


27.525 


1.00 


22.53 
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ATOM 


1660 


CD2 


TYR 


1101 


85.383 


70.859 


27.195 


1.00 


22.07 


ATOM 


1661 


CE2 


TYR 


1101 


85.161 


69.998 


26.137 


1.00 


21.98 


ATOM 


1662 


cz 


TYR 


1101 


85.363 


68.642 


26.304 


1.00 


22.76 


ATOM 


1663 


OH 


TYR 


1101 


85.130 


67.781 


25.254 


1.00 


24.74 


ATOM 


1664 


C 


TYR 


1101 


87.750 


72.764 


30.716 


1.00 


21.21 


ATOM 


1665 


O 


TYR 


1101 


88.437 


72.187 


31.560 


1.00 


22.22 


ATOM 


1666 


N 


THR 


1102 


87.236 


73.973 


30.890 


1.00 


23.24 


ATOM 


1667 


CA 


THR 


1102 


87.424 


74.741 


32.111 


1.00 


24.37 


ATOM 


1668 


CB 


THR 


1102 


88.640 


75.681 


32.013 


1.00 


24.53 


ATOM 


1669 


OG1 


THR 


1102 


89.738 


74.991 


31.399 


1.00 


23.57 


ATOM 


1670 


CG2 


THR 


1102 


89.047 


76.169 


33.403 


1.00 


22.58 


ATOM 


1671 


C 


THR 


" 1102 


86.175 


75.597 


32.244 


1.00 


25.86 


ATOM 


1672 


O 


THR 


1102 


85.658 


76.094 


31.246 


1.00 


28.08 


ATOM 


1673 


N 


ASN 


1103 


85.668 


75.740 


33.460 


1.00 


26.20 


ATOM 


1674 


CA 


ASN 


1103 


84.480 


76.547 


33.700 


1.00 


25.78 


ATOM 


1675 


CB 


ASN 


1103 


83.213 


75.752 


33.372 


1.00 


26.82 


ATOM 


1676 


CG 


ASN 


1103 


81 .947 


76.585 


33.511 


1.00 


28.67 


ATOM 


1677 


OD1 


ASN 


'IV 

1103 


81 .932 


77.609 


34.199 


1.00 


30.76 


ATOM 


1678 


ND2 


ASN 


1103 


80.871 


76.135 


32.882 


1:00 


27.57 


ATOM 


1679 


C 


ASN 


1103 


84.480 


76.995 


35.156 


1.00 


26.25 


ATOM 


1680 


O 


ASN 


1103 


83.832 


76.391 


36.006 


1.00 


27.25 


ATOM 


1681 


N 


VAL 


110^4 


85.212 


78.069 


35.430 


1.00 


27.39 


ATOM 


1682 


CA 


VAL 


1104 


85.333 


78.61 1 


36.776 


1.00 


28.21 


ATOM 


1683 


CB 


VAL 


1104 


86.025 


79.980 


36.783 


1.00 


27.51 


ATOM 


1684 


CG1 


VAL 


1104 


86.316 


80.397 


38.205 


1.00 


28.55 


ATOM 


1685 


CG2 


VAL 


1104 


87.298 


79.948 


35.963 


1.00 


28.06 


ATOM 


1686 


C 


VAL 


1104 


83.978 


78.796 


37.419 


1.00 


30.54 


ATOM 


1687 


O 


VAL 


11Q4 


83.803 


78.517 


38.604 


1.00 


31.53 


ATOM 


1688 


N 


ASP 


1105 


83.022 


79.260 


36.626 


1.00 


33.66 


ATOM 


1689 


CA 


ASP 


1105 


81 .677 


79.503 


37.117 


1.00 


36.82 


ATOM 


1690 


CB 


ASP 


1105 


80.788 


80.061 


36.001 


1.00 


41.34 


ATOM 


1691 


CG 


ASP 


1105 


81.192 


81.457 


35.573 


1.00 


45.36 


ATOM 


1692 


OD1 


ASP 


1105 


81.157 


82.369 


36.429 


1.00 


47.53 


ATOM 


1693 


OD2 


ASP 


1105 


81.543 


81.636 


34.382 


1.00 


48.54 


ATOM 


1694 


C 


ASP 


1105 


81.058 


78.241 


37.676 


1.00 


36.71 


ATOM 


1695 


O 


ASP 


1105 


80.645 


78.200 


38.835 


1.00 


36.88 


ATOM 


1696 


N 


GLN 


1106 


81.011 


77.203 


36.851 


1.00 


36.71 


ATOM 


1697 


CA 


GLN 


1106 


80.417 


75.940 


37.266 


1.00 


36.44 


ATOM 


1698 


CB 


GLN 


1106 


79.895 


75.194 


36.042 


1.00 


38.58 


ATOM 


1699 


CG 


GLN 


1106 


78.821 


76.012 


35.312 


1.00 


43.02 


ATOM 


1700 


CD 


GLN 


110.6 


78.192 


75.305 


34.120 


1.00 


44.86 


ATOM 


1701 


OE1 


GLN 


1106 


77.334 


75.870 


33.437 


1.00 


44.11, 


ATOM 


1702 


NE2 


GLN 


1106 


78.606 


74.068 


33.866 


1.00 


47.22 


ATOM 


1703 


C 


GLN 


1106 


81.342 


75.077 


38.122 


1.00 


34.06 


ATOM 


1704 


O 


GLN 


1106 


80.930 


74.029 


38.625 


1.00 


33.78 


ATOM 


1705 


N 


ASP 


1107 


82.561 


75.575 


38.339 


1.00 


29.82 


ATOM 


1706 


CA 


ASP 


1107 


83.577 


74.900 


39.144 


1.00 


26.00 


ATOM 


1707 


CB 


ASP 


1107 


83.039 


74.635 


40.551 


1.00 


26.29 ; 


ATOM 


1708 


CG 


ASP 


1107 


84.138 


74.436 


41.574 


1.00 


25.28 


ATOM 


1709 


OD1 


ASP 


1107 


85.287 


74.822 


41.303 


1.00 


26.49; 


ATOM 


1710 


OD2 


ASP 


1107 


53.844 


73.914 


42.666 


1.00 


25.48; 
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ATOM 1711 


C 


ASP 


1107 


84.016 


73.597 


38.492 


1.00 


23.82 


ATOM 1712 


0 


ASP 


1107 


84.348 


72.637 


39.176 


1.00 


23.57 


ATOM 1713 


N 


LEU 


1108 


84.066 


73.593 


37.167 


1.00 


21.98 


ATOM 1714 


CA 


LEU 


1108 


84.440 


72.413 


36.403 


1.00 


22.63 


ATOM 1715 


CB 


LEU 


1108 


83.349 


72.083 


35.378 


1.00 


21.22 


ATOM 1716 


CG 


LEU 


1108 


82.093 


71.415 


35.928 


1.00 


21.50 


ATOM 1717 


CD1 


LEU 


1108 


81.053 


71.249 


34.840 


1.00 


21.03 


ATOM 1718 


CD2 


LEU 


1108 


82.473 


70.079 


36.506 


1.00 


20.86 


ATOM 1719 


C 


LEU 


1108 


85.781 


72.524 


35.686 


1.00 


22.72 


ATOM 1720 


0 


LEU 


1108 


86.206 


73.608 


35.285 


1.00 


24.88 


ATOM 1721 


N 


VAL 


1109 


86.413 


71.375 


35.495 


1.00 


20.91 


ATOM; 1722 


CA 


VAL 


1109 


87.695 


71 .260 


34.820 


1.00 


20.17 


ATOM] 1723 


CB 


VAL 


1109 


88.855 


71.399 


35.833 


1.00 


21.15 


ATOM? 1724 


CG1 


VAL 


1109 


90.117 


70.7#8 


35.292 


1.00 


22.22 


ATOM 1725 


CG2 


VAL 


1109 


89.092 


72.85,8 


36.156 


1.00 


22.70 


ATOMj 1726 


C 


VAL 


1109 


87.674 


69.850 


34.245 


1.00 


19.05 


ATOM 1727 


O 


VAL 


1109 


87.065 


68.967 


34.839 


1.00 


19.81 


ATOMf 1728 


N 


GLY 


1110 


88.306 


69.648 


33.092 


1.00 


17.18 


ATOM; 1729 


CA 


GLY 


1110 


88.328 


68.334 


32.477 


1.00 


16.06 


ATOM- 1730 


C 


GLY 


1110 


89.518 


68. 1& 


31.558 


1.00 


17.59 


ATOMJ 1731 


0 


GLY 


1110 


89.896 


69.05"0 


30.823 


1.00 


16.16 


ATOM! 1732 


N 


TRP 


1111 


90.139 


66.957 


31.650 


1.00 


17.19 


ATOM 1733 


CA 


TRP 


1111 


91.299 


66.569 


30.835 


1.00 


16.21 


ATOM: 1734 


CB 


TRP 


1111 


92.498 


66.181 


31.721 


1.00 


14.75 


ATOM 1735 


CG 


TRP 


1111 


93.039 


67.225 


32.642 


1.00 


11.64 


ATOM 1736 


CD2 


TRP 


1111 


92.802 


67.337 


34.058 


1.00 


11.04 


ATOM. 1737 


CE2 


TRP 


1111 


93.577 


68.419 


34.525 


1.00 


11.37 


ATOM 1738 


CE3 


TRP 


1111 


92.01 1 


66.626 


34.973 


1.00 


13.68 


ATOM 1739 


CD1 


TRP 


1111 


93.916 


68.213 


32.321 


1.00 


11.69 


ATOM 1740 


NE1 


TRP 


1111 


94.247 


68.935 


33.447 


1.00 


14.19 


ATOM 1741 


CZ2 


TRP 


1111 


93.589 


68.809 


35.867 


1.00 


11.96 


ATOM 1742 


CZ3 


TRP 


1111 


92.022 


67.01 1 


36.308 


1.00 


11.51 


ATOM 1743 


CH2 


TRP 


1111 


92.808 


68.095 


36.741 


1.00 


13.13 


ATOM' 1744 


C 


TRP 


1111 


90.873 


65.290 


30.125 


1.00 


17.82 


ATOM 1745 


O 


TRP 


1111 


89.882 


64.670 


30.516 


1.00 


20.28 


ft ^ m ^K.% ft ~a mm m 

ATOM 1746 


N 


PRO 


1112 


91.581 


64.885 


29.053 


1.00 


18.22 


A "T*/^ ft ft j mm m —m 

ATOM 1747 


CD 


PRO 


1112 


92.71 1 


65.493 


28.331 


1.00 


17.52 


A m l m M*^ ft 4 ^ ft A 

ATOM 1748 


CA 


PRO 


1112 


91.153 


63.637 


28.406 


1.00 


16.49 


A ft ft ^ mm m 

ATOM 1749 


CB 


PRO 


1112 


92.189 


63.452 


27.309 


1.00 


15.54 


A "^^^N ft ft J mm am *\ 

ATOM 1750 


CG 


PRO 


1112 


92.586 


64.846 


26.978 


1.00 


18.88 


A *^ m S m \ ft ft j -J 

ATOM; 1751 


C 


PRO 


1112 


91.318 


62.567 


29.477 


1.00 


18.48 


ATOM. 1752 


O 


PRO 


1112 


92.207 


62.679 


30.322 


1.00 


21.84 


A ^^*ft^^ » 4 J mmw, ^m. 

ATOM*;1753 


N 


ALA 


1113 


90.443 


61.573 


29.512 


1.00 


19.42 


A ft ft -J «*v p- m 

ATOM;1754 


CA 


ALA 


1113 


90.572 


60.540 


30.526 


1.00 


18.50 


ATOM «1 755 


CB 


ALA 


1113 


89.441 


59.543 


30.420 


1.00 


17.46 


ATOM i1 756 


C 


ALA 


1113 


91.909 


59.851 


30.319 


1.00 


20.35 


ATOM |1757 


O 


ALA 


1113 


92.278 


59.532 


29.189 


1.00 


21.19 


ATOMf 1758 


N 


PRO 


1114 


92.689 


59.681 


31 .396 


1.00 


21.47 


ATOM |1759 


CD 


PRO 


1114 


92.488 


60.128 


32.783 


1.00 


22.45 


ATOM 1 1760 


CA 


PRO 


1114 


93.984 


59.022. 


31.250 


1.00 


22.96 


ATOM i1 761 


CB 


PRO 


1114 


91.529 


59.018 


32.682 


1.00 


23.65 
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ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 



1762 

1763 

1764 

1765 

1766 

1767 

1768 

1769 

1770 

1771 

1772 

1773 

1774 

1775 

1776 

1777 

1778 

1779 

1780 

1781 

1782 

1783 

1784 

1785 

1786 

1787 

1788 

1789 

1790 

1791 

1792 

1793 

1794 

1795 

1796 

1797 

1798 

1799 

1800 

1801 

1802 

1803 

1804 

1805 

1806 

1807 

1808 

1809 

1810 

1811 

1812 



CG 
C 
O 
N 

CA 

CB 

CG 

CD 

OE1 

NE2 

C 

O 

N 

CA 
C 
O 
N 

CA 

CB 

OG 

C 

O 

N 

CA 

CB 

CG 

CD 

NE 

CZ 

NH1 

NH2 

C 

O 

N 

CA 

CB 

OG 

C 

O 

N 

CA 

CB 

CG 

CD1 

CD2 

C 

O 

N 

CA 
CB 
OG1 



PRO 

PRO 

PRO 

GLN 

GLN 

GLN 

GLN 

GLN 

GLN 

GLN 

GLN 

GLN 

GLY 

GLY 

GLY 

GLY 

SER 

SER 

SER 

SER 

SER 

SER 

ARG 

ARG 

ARG 

ARG 

ARG 

ARG 

ARG 

ARG 

ARG 

ARG 

ARG 

SER 

SER 

SER 

SER 

SER 

SER 

LEU 

LEU 

LEU 

LEU 

LEU 

LEU 

LEU 

LEU 

THR 

THR 

THR 

THR 



111-4 
1114 
1114 
1115 
1115 
1115 
1115 
1115 
1115 
1115 
1115 
1115 
1116 
1116 
1116 
1116 
1117 
1117 
1117 
1117 
1117 
1117 
1118 
1118 
1118 
1118 
1118 
1118 
1118 
1118 
1118 
1118 
1118 
1119 
1119 
1119 
1119 
1119 
1119 
1120 
1120 
1120 
1120 
1120 
1120 
1120; 
1120' 
1121? 
1121? 
1121? 
1121* 



93.306 

93.835 

92.888 

94.730 

94.706 

95.903 

95.895 

94.865 

94.731 

94.122 

94.762 

95.548 

93.881 

93.842 

92.595 

92.155 

92.027 

90.834 

90.688 

90.524 

89.567 

89.592 

88.468 

87.170 

86.635 

87.368 

87.604 

87.874 

88.830 

88.981 

89.661 

86.252 

86.375 

85.355 

84.446 

84.333 

85.535 

83.053 

82.620 

82.379 

81.008 

80.691 

81.329 

80.978 

80.854 

80.197 

80.753 

78.893 

78.012 

77.383 

77.011 



59.138 

57.620 

56.903 

57.254 

55.925 

55.744 

56.656 

56.236 

55.051 

57.203 

54.895 

55.039 

53.902 

52.862 

52.883 

51.840 

54.071 
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1197 




73.360 


69.780 


38.748 


1.00 


21.74 


ATOM 2355 


C 


PRO 


1197 




75.444 


67.751 


36.557 


1.00 


24.76 


ATOM 2356 


O 


PRO 


1197 




76.143 


68.606 


36.01 1 


1.00 


25.57 


ATOM 2357 


N 


VAL 


1198 




75.267 


66.537 


36.044 


1.00 


26.89 


ATOM 2358 


CA 


VAL 


1198 




75.923 


66.159 


34.797 


1.00 


28.24 


ATOM 2359 


CB 


VAL 


1198 




75.541 


64.746 


34.338 


1.00 


27.91 


ATOM 2360 


CG1 


VAL 


1198 




76.318 


64.392 


33.091 


1.00 


32.68 


ATOM 2361 


CG2 


VAL 


1198 




75.869 


63.747 


35.401 


1.00 


28.52 


ATOM 2362 


C 


VAL 


1198 




75.583 


67.148 


33.684 


1.00 


29.68 


ATOM 2363 


O 


VAL 


1198 




76.397 


67.390 


32.794 


1.00 


31.51 


ATOM 2364 


N 


ALA 


1199 




74.394 


67.741 


33.754 


1.00 


30.96 


ATOM 2365 


CA 


ALA 


1199 




73.954 


68.718 


32.759 


1.00 


28.72 


ATOM 2366 


CB 


ALA 


1199 




72.610 


69.294 


33.157 


1.00 


30.86 


ATOM 2367 


C 


ALA 


1199 




74.970 


69.838 


32.603 


1.00 


27.35 


ATOM 2368 


O 


ALA 


1199 




75.193 


70.346 


31.503 


1.00 


27.43 


ATOM 2369 


N 


ASN 


1200 




75.593 


70.223 


33.707 


1.00 


25.52 


ATOM 2370 


CA 


ASN 


1200 




76.586 


71.280 


33.671 


1.00 


25.79 


ATOM 2371 


CB 


ASN 


1200 


X 


76.951 


71.723 


35.084 


1.00 


25.77 


ATOM 2372 


CG 


ASN 


1200 




75.810 


72.442 


35.789 


1.00 


27.58 


ATOM 2373 


OD1 


ASN 


1200 




76.015 


73.075 


36.828 


1.00 


29.86 



f 
I 



% 



a; 
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ATOM 2374 ND2 ASN 1200 74.602 72.337 35.246 1.00 25.16 

ATOM 2375 C ASN 1200 77.828 70.842 , 32.907 1.00 26.50 

ATOM 2376 O ASN 1200 " 78.544 71.681 32.358 1.00 25.46 

ATOM 2377 N LEU 1201 78.063 69.531 32.844 1.00 27.79 

ATOM 2378 CA LEU 1201 79.213 68.977 32.125 1.00 27.44 

ATOM 2379 CB LEU 1201 79.400 67.488 32.437 1.00 27.76 

ATOM 2380 CG LEU 1201 79.560 67.059 - 33.897 1.00 27.52 

ATOM 2381 CD1 LEU 1201 79.793 65.564 133.955 1.00 28.46 

ATOM 2382 CD2 LEU 1201 80.703 67.790 134.548 1.00 24.26 

ATOM 2383 C LEU 1201 79.021 69.163 -30.627 1.00 26.96 

ATOM 2384 O LEU 1201 79.891 69.694 129.946 1.00 26.74 

ATOM 2385 N GLU 1202 77.864 68.752 130.125 1.00 27.76 

ATOM 23§6 CA GLU 1202 77.555 68.886 §28.703 1.00 30.15 

ATOM 2387 CB GLU 1202 76.207 68.237 §28.404 1.00 35.17 

ATOM 2388 CG GLU 1202 75.915 68.080 126.912 1.00 42.07 

ATOM 2389 CD GLU 1202 76.742 66.985 f 26.247 1.00 45.05 

ATOM 2390 OE1 GLU 1202 76.867 65.882 |26.837 1.00 45.05 

ATOM 2391 OE2 GLU 1202 77.253 67.232 «25.129 1.00 48.05 

ATOM 2392 C GLU 1202 77.530 70.362 ?28.311 1.00 27.55 

ATOM 2393 O GLU 1202 77.988 70.746 §27.239 1.00 26.78 

ATOM 2394 N THR 1203 76.989 71.182 |29.200 1.00 27.54 

ATOM 2395 CA THR 1203 76.916 72.618 128.997 1.00 28.52 

ATOM 2396 CB THR 1203 76.183 73.278 *30.168 1.00 29.70 

ATOM 2397 OG1 THR 1203 74.814 72.848 ^30.179 1.00 31.63 

ATOM 2398 CG2 THR 1203 76.248 74.787 ^30.066 1.00 30.94 

ATOM 2399 C THR 1203 78.336 73.175 28.923 1.00 30.35 

ATOM 24Q0 O THR 1203 78.630 74.038 -28.099 1.00 31.87 

ATOM 2401 N THR 1204 79.204 72.693 ?29.810 1.00 30.89 

ATOM 2402 CA THR 1204 80.597 73.126 29.839 1.00 30.06 

ATOM 2403 CB THR 1204 81.325 72.545 31.070 1.00 27.84 

ATOM 2404 OG1 THR 1204 80.789 73.148 32.254 1.00 28.92 

ATOM 2405 CG2 THR 1204 82.823 72.804 31.010 1.00 26.94 

ATOM 2406 C THR 1204 81.329 72.732 28.551 1.00 31.06 

ATOM 2407 O THR 1204 82.238 73.435 28.113 1.00 31.97 

ATOM 2408 N MET 1205 80.918 71.622 27.940 1.00 30.02 

ATOM 2409 CA MET 1205 81.545 71.155 26.710 1.00 29.36 

ATOM 2410 CB MET 1205 81.286 69.670 26.503 1.00 26.89 

ATOM 2411 CG MET 1205 81.846 68.816 27.605 1.00 26.79 

ATOM 2412 SD MET 1205 81.869 67.096 27.164 1.00 25.34 

ATOM 2413 CE MET 1205 80.165 66.684 27.336 1.00 26.80 

ATOM 2414 C MET 1205 81.101 71.927 25.480 1.00 30.12 

ATOM 241.5 O MET 1205 81.795 71.910 24.463 1.00 31.11 

ATOM 2416 N ARG 1206 79.944 72.582 25.560 1.00 32.04 

ATOM 2417 CA ARG 1206 79.425 73.372 24.443 1.00 32.33 

ATOM 241& CB ARG 1206 77.898 73.466 24.509 1.00 33.91 

ATOM 2419 CG ARG 1206 77.180 72.147 24.315 1.00 36.45 

ATOM 242Q CD ARG 1206 75.689 72.368 24.083 1.00 39.82 

ATOM 24231 NE ARG 1206 74.994 72.858 25.274 1.00 44.02 

ATOM 2422 CZ ARG 1206 74.337 72.082 26.140 1.00 44.21 

ATOM 2423 NH1 ARG 1206 74.272 70.767 .25.962 1.00 41.77 

ATOM 242*4 NH2 ARG 1206 73.760 72.623 27.207 1.00 44.77 
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ATOM 2425 
ATOM 2426 



FIGURE 3 ( 
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C ARG 1206 
O ARG 1206' 



80.030 74.780 24.404 1.00 31.71 
80.163 75.405 25.487 1.00 33.75 



SNS4ACOORDI NATES (Complex B) 



Atom 






Type 


Resid 


# 


ATOM 2535 


CB 


LYS 


ATOM 2536 


CG 


LYS 


ATOM 2537 


CD 


LYS 


ATOM 2538 


CE 


LYS 


ATOM 2539 


NZ 


LYS 


ATOM 2540 


C 


LYS 


ATOM 2541 


O 


LYS 


ATOM 2542 


N 


LYS 


ATOM 2543 


CA 


LYS 


ATOM 2544 


N 


GLY 


ATOM 2545 


CA 


GLY 


ATOM 2546 


C 


GLY 


ATOM 2547 


O 


GLY 


ATOM 2548 


N 


SER 


ATOM 2549 


CA 


SER 


ATOM 2550 


CB 


SER 


ATOM 2551 


OG 


SER 


ATOM 2552 


C 


SER 


ATOM 2553 


O 


SER 


ATOM 2554 


N 


VAL 


ATOM 2555 


CA 


VAL 


ATOM 2556 


CB 


VAL 


ATOM 2557 


CG1 


VAL 


ATOM 2558 


CG2 


VAL 


ATOM 2559 


C 


VAL 


ATOM 2560 


O 


VAL 


ATOM 2561 


N 


VAL 


ATOM 2562 


CA 


VAL 


ATOM 2563 


CB 


VAL 


ATOM 2564 


CG1 


VAL 


ATOM 2565 


CG2 


VAL 


ATOM 2566 


C 


VAL 


ATOM 2567 


O 


VAL 


ATOM 2568 


N 


ILE 


ATOM 2569 


CA 


ILE 


ATOM 2570 


CB 


ILE 


ATOM 2571 


CG2 


ILE 


ATOM 2572 


CG1 


ILE 


ATOM 2573 


CD1 


ILE 


ATOM 2574 


C 


ILE 


ATOM 2575 


O 


ILE 


ATOM 2576 


N 


VAL 


ATOM 2577 


CA 


VAL 



X 


Y 


2 


1677 


101.069 


76.195 


1677; ; 


100.696 


76.412 


1677 


99.373 


77.170 


1677| 


99.221 


77.735 


1677* 


97.898 


78.400 


16771 


102.635 


74.294 


1677^ 


102.747 


73.868 


1677| 


103.162 


76.602 


1677; 


102.536 


75.793 


16781 


102.595 


73.497 


1678< 


102.708 


72.057 


1678* 


101.509 


71.417 


1678: 
1679j 


100.461 


72.045 


101.684 


70.181 


1679 ; 


100.606 


69.443 


1679* 


101.167 


68.417 


1679 


101.609 


69.016 


1679 


99.807 


68.703 


1679 


100.280 


68.506 


1680 


98.574 


68.346 


1680 


97.724 


67.576 


1680 


96.252 


67.694 


1680 


95.392 


66.728 


1680 


95.784 


69.112 


1680 


98.230 


66.158 


1680 


98.478 


65.754 


1681 


98.463 


65.442 


1681 


98.990 


64.085 


1681 


100.343 


63.971 


1681 


100.968 


62.616 


1681 


101.283 


65.039 


1681 


98.050 


63.055 


1681 


97.459 


63.277 


1682; 


97.956 


61.910 


1682 


97.134 


60.795 


1682 


96.791 


59.853 


1682; 


96.052 


58.633 


1682i 


95.940 


60.597 


1682; 


95.611 


59.800 


1682 ; 


97.926 


60.028 


1682i 


98.998 


59.474 


1683| 


97.425 


60.045 


1683> 


98.074 


59.345 



OCC B 

38.823 1.00 42.61; 

40.284 1.00 45.42 si 

40.435 1.00 48.57 " 

41.859 1.00 50.16?; 

42.112 1.00 51.221 

38.331 1.00 39.53 f 

37.171 1.00 39.441 

37.521 1.00 40.631 
38.615 1.00 40.891 
39.393 1.00 37.81 f 
39.240 1.00 34.76* 
38.576 1.00 31.901 
38.430 1.00 31.54? 
38.126 1.00 28.52 1 
37.497 1.00 25.57 j 
36.506 1.00 26.14 ^ 

35.303 1.00 24.59- 
38.562 1.00 24.40 
39.688 1.00 24.38 
38.219 1.00 23.52 
39.106 1.00 21.39 
38.697 1.00 21.22 
39.493 1.00 19.38 
38.932 1.00 21.01 , 
38.894 1.00 20.14 
37.754 1.00 18.28 
39.990 1.00 20.40 
39.922 1.00 20.62 
40.678 1.00 18.90 
40.454 1.00 19.40 
40.231 1.00 20.79 

40.522 1.00 21.25 
41.584. 1.00 19.23 . 
39.850 1.00 22.25 

40.304 1.00 22.27 
39.125 1.00 18.88 
39.627 1.00 19.14 {■ 
38.094 1.00 17.33 * 
36.862 1.00 14.03 * 
41.376 1.00 23.98 a. 
41.090 1.00 24.49 £ 
42.611 1.00 24.04 f 
43.724 1.00 22.87 £ 
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ATOM12578 
ATOM;2579 
ATOM 2580 
ATOM.2581 
ATOM 2582 
ATOM 2583 
ATOM. 2584 
ATOM 2585 
ATOM 2586 
ATOM2587 
ATOM:2588 
ATOMI2589 
ATOM|2590 
ATOM|2591 
ATOMf2592 
ATOMI2593 
ATOM|2594 
ATOMf2595 
ATOM |2596 
ATOM 12597 
ATOM 12598 
ATOM 12599 
ATOM?2600 
ATOM 52601 
ATOM 2602 
ATOM 2603 
ATOM ,2604 
ATOM 2605 
ATOM 2606 
ATOM 2607 
ATOM 2608 
ATOM 2609 
ATOM 2610 
ATOM 261 1 
ATOM 2612 
ATOM 2613 
ATOM 2614 
ATOM 2615 
ATOM 2616 
ATOM 2617 
ATOM 2618 
ATOM 2619 
ATOM 2620 
ATOM 2621 
ATOM &622 
ATOM 2623 
ATOM 2624 
ATOM 2625 
ATOM 2626 
ATOM 2627 
ATOM 2628 
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CB 


VAL 


1683 


98.294 


60.27|3 


44.960 


1.00 


20.49 


CQ1 


W A 1 

VAL 


1683 


99.218 


61 .422 


44.602 


1.00 


21.14 


CG2 


VAL 


1683 


96.986 


60.787 


45.493 


1.00 


15.48 


C 


VAL 


1683 


97.319 


58.086 


44.177 


1.00 


22.99 


0 


VAL 


1683 


97.667 


57.470 


45.190 


1.00 


25.32 


N 


GLY 


1684 


96.294 


57.704 


43.423 


1.00 


22.53 


CA 


GLY 


1684 


95.515 


56.527 


43.757 


1.00 


21.02 


C 


GLY 


1684 


94.291 


56.452 


42.870 


1.00 


21.67 


0 


GLY 


1684 


94.144 


57.267 


41.956 


1.00 


21.86 


N 


ARG 


1685 


93.399 


55.507 


43.154 


1.00 


22.35 


CA 


ARG 


1685 


92.177 


55.327 


42.369 


1.00 


19.90 


CB 


ARG 


1685 


92.449 


54.463 


41.150 


1.00 


20.04 


CG 


ARG 


1685 


93.003 


53.105 


41.516 


1.00 


22.48 


CD 


ARG 


1685 


93.157 


52.240 


40.296 


1.00 


26.67 


NE 


ARG 


1685 


94.237 


51.27| 


40.446 


1.00 


28.97 


CZ 


ARG 


1685 


95.485 


51.48J 


40.048 


1.00 


27.79 


NH1 


ARG 


1685 


95.815 


52.631 


39.474 


1.00 


27.86 


NH2 


ARG 


1685 


96.404 


50.557 


40.224 


1.00 


29.00 


C 


ARG 


1685 


91.113 


54.636 


43.185 


1.00 


20.01 


O 


ARG 


1685 


91.404 


54.035 


44.218 


1.00 


22.88 


N 


ILE 


1686 


89.882 


54.703 


42.697 


1.00 


18.51 


CA 


ILE 


1686 


88.745 


54.066 


43.346 


1.00 


17.38 


CB 


ILE 


1686 


87.823 


55.120 


43.985 


1.00 


15.63 


CG2 


ILE 


1686 


86.464 


54.533 


44.292 


1.00 


14.38 


CG1 


ILE 


1686 


88.496 


55.665 


45.251 


1.00 


14.48 


CD1 


ILE 


1686 


87.740 


56.758 


45.935 


1.00 


14.17 


C 


ILE 


1686 


88.043 


53.247 


42.268 


1.00 


17.71 


O 


ILE 


1686 


87.699 


53.769 


41.205 


1.00 


17.73 


N 


VAL 


1687 


87.945 


51.940 


42.484 


1.00 


16.52 


CA 


VAL 


1687 


87.325 


51.073 


41.493 


1.00 


14.59 


CB 


VAL 


1687 


88.152 


49.810 


41.237 


1.00 


13.39 


CG1 


VAL 


1687 


87.509 


48.980 


40.145 


1.00 


11.21 


CG2 


VAL 


1687 


89.564 


50.184 


40.842 


1.00 


11.10 


c 


VAL 


1687 


85.922 


50.681 


41 .874 


1.00 


15.40 


o 


VAL 


1687 


85.695 


50.067 


42.916 


1.00 


17.30 


A 1 

N 


LEU 


1688 


84.990 


51 .081 


41.016 


1.00 


16.16 


CA 


LEU 


1688 


83.566 


50.827 


41.171 


1.00 


15.94 


CB 


LEU 


1688 


82.792 


52.003 


40.578 


1.00 


14.08 


CG 


LEU 


4 aaa 

1688 


82.936 


53.300 


41.365 


1.00 


15.70 


CD1 


LEU 


4 AAA 

1688 


82.611 


54.51 1 


40.519 


1.00 


15.92 


CD2 


LEU 


4 aaa 

1688 


82.026 


53.212 


42.559 


1.00 


18.17 


C 


LEU 


d AAA 

1688 


83.129 


49.532 


40.481 


1.00 


17.97 


0 


LEU 


^ AAA 

1688 


82.209 


48.856 


40.939 


1.00 


20.09 


N 


SER 


1689 


83.826 


49.174 


39.406 


1.00 


20.02 


CA 


SER 


1689 


83.518 


47.982 . 


38.61 1 


1.00 


21.75 


CB 


SER 


1689 


84.253 


48.060 


37.266 


1.00 


23.96 


OG 


SER 


1689 


85.605 


48.467 


37.436 


1.00 


27.29 


C 


SER 


1689 


83.749 


46.603; 


39.243 


1.00 


21.43 


O 


SER 


1689 


83.290 


45.596 


38.700 


1.00 


23.26 


N 


GLY 


1690 


84.465 


46.547 : 


40.364 


1.00 


21.24 


CA 


GLY 


1690 


724 


45.277 


41.015 


1.00 


19.68 
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A"rr\R4 ftftftrt 

ATOM 2629 


c 


GLY 


1 690 


83-431 


44.657 


A M Pftft 

41 .500 


1.00 


r\ a r\A 

21.01 


A A ftftftft 

ATOM 2630 


O 


GLY 


^1690 


82.452 


45.367 


A A ^ A A 

41 .741 


A ftft 

1.00 


ftft ftft 

20.02 


ATOM 2631 


A 1 

N 


LYS 


S 1691 


83.419 


43.334 


41.618 


1.00 


22.40 


A A A /•% ^\ O ft 

ATOM 2632 


CA 


LYS 


£ 1691 


82.245 


A ft l~ftS% 

42.596 


42.075 


A ftft 

1.00 


ft t TO 

25.78 


ATOM 2633 


CB 


LYS 


< 1691 


81.621 


A 4 ftftft 

41.838 


A ft ftft -A 

40.901 


A ftft 

1 .00 


ftft. oft 
30.22 


ATOM 2634 


oo 
CG 


LYS 


1691 


81 .068 


A ft *V A A 

42.740 


ftft A A ft 

39.813 


•i ftft 

1.00 


fto ftft 
38.96 


ATOM 2635 


CD 


LYS 


11691 


80.651 


41 .947 


38.575 


A ftft 

1.00 


'A ft A A 

46.14 


ATOM 2636 


or - 

CE 


LYS 


11691 


80.270 


42.880 


37.405 


1.00 


A ft Aft 

49.42 


A *T"^^A A /"\ ^ *T 

ATOM 2637 


NZ 


1 V /ft> 

LYS 


1 1691 


79.814 


42.127 


36.186 


1.00 


r*ft ft t 

50.87 


A ^P*^^ ft A ^^^^^^^^ 

ATOM 2638 


C 


LYS 


§1691 


82.705 


41.610 


43.142 


1.00 


i25.02 


ATOM 2639 


O 


LYS 


|1691 


83.885 


41 .254 


43.176 


1.00 


26.26 


ATOM 2640 


a i 
N 


PRO 


I 1 692 


81.796 


41.167 


44.031 


1.00 


23.86 


ATOM 2641 


CD 


PRO 


§1692 


80.365 


41.516 


44.115 


1.00 


§5,36 


ATOM 2642 


CA 


PRO 


11692 


82.157 


40.217 


45.088 


1.00 


23.51 


ATOM 2643 


CB 


PRO 


|1692 


80.801 


39.691 


45.542 


.1.00 


23.49 


ATOM 2644 


CG 


PRO 


§1692 


79.954 


40.91 1 


45.451 


1.00 


24.91 


ATOM 2645 


. C 


PRO 


§1692 


83.027 


39.095 


44.548 


1.00 


23.24 


ATOM 2646 


0 


PRO 


|1692 


82.724 


38.506 


43.515 


1.00 


25.88 


ATOM 2647 


N 


ALA 


11693 


84.136 


38.831 


45.219 


1.00 


21.89 


ATOM 2648 


CA 


ALA 


11693 


85.035 


37.787 


44.780 


1.00 


21.26 


ATOM 2649 


CB 


ALA 


|1693 


86.173 


38.383 


43.989 


1.00 


21.74 


**^ _ j'^ft < *\. 
ATOM 2650 


C 


ALA 


|1693 


85.568 


37.089 


46.000 


1.00 


<?2.42 


ATOM 2651 


0 


ALA 


S1693 


85.708 


37.705 


47.055 


1.00 


24.98 


ATOM 2652 


N 


ILE 


§1694 


85.810 


35.791 


45.879 


1.00 


22.49 


ATOM 2653 


CA 


ILE 


11694 


86.342 


35.020 


46.989 


1.00 


2M.65 


ATOM 2654 


CB 


ILE 


11694 


86.052 


33.524 


46.794 


1.00 


20.10 


ATOM 2655 


CG2 


ILE 


-1694 


86.718 


32.714 


47.873 


1.00 


22.22 


ATOM 2656 


CG1 


ILE 


? 1694 


84.539 


33.293 


46.829 


1.00 


20.16 


ATOM 2657 


CD1 


ILE 


:1694 


84.133 


31 .872 


46.593 


1.00 


19.23 


ATOM 2658 


C 


ILE 


:1694 


87.832 


35.311 


46.999 


1.00 


22.06 


ATOM 2659 


O 


ILE 


*1694 


88.506 


35.118 


45.988 


1.00 


25.81 


ATOM 2660 


N 


ILE 


1695 


88.336 


35.837 


48.109 


1.00 


21.38 
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